For a while now, I've wanted to play around with visualizing data on the books I've read over the last few years, particularly to discover what genres or themes I lean more toward, and to see how my reading habits have changed over time. Recently, two things happened that made me decide to act on this curiosity.
The process of collecting and organizing the data was more onerous than I anticipated. Initially, I imported my data from from Goodreads to StoryGraph, but the import was rather messy. I had to go through each record on StoryGraph to ensure its accuracy and then export the updated data for my visualization purposes. Unfortunately, StoryGraph's supplementary categories which I was especially excited about (book genre, mood, pace, etc.) are proprietary and were therefore excluded from the export. [2] I figured I could then just use whatever categories StoryGraph allowed me to export, but I soon realized that my struggles with data were far from over. I kept finding new problems and discrepancies that I hadn't seen coming. Every time I fixed one, another popped up, and it became a seemingly never-ending cycle that spanned over 4 days:
After struggling through the tedious process of organizing and cleaning the data, and feeling like I had achieved a decent level of accuracy (not perfect, but good enough), I finally got to the fun part: visualizing the data. I began with browsing the net for some inspiration, and am particularly grateful to:
Now, take a look at the trends and insights from my last 10 years in books below:
You can interact with the full story on Tableau Public here.
Overall, while my initial plans for the visualizations were more ambitious than what the data allowed, I'm still pleased about the intermediate deliverable I ended up with. Who knows what version 3, or 4, or 5 of this will look like in the future when I have access to more sophisticated data!
[1] For example, two of the most impactful non-fiction books I read last year were Four Thousand Weeks and The Pathless Path. I read the first at a particularly tumultuous point in my life when I was chronically busy and overwhelmed with all the things I wanted or needed to do. It was precisely the right time for me to step back engage with Oliver Burkeman's ideas about re-thinking our relationship to time. And a few months afterwards I read The Pathless Path around when I was thinking of leaving my job to go on a sabbatical. That too felt like exactly the right time to read this book. Reading about Paul Millerd's journey gave me the confidence/permission-slip to step off the treadmill and carve out a bold new path. Contrast this with Americanah, a book I've read twice. First, in my sophomore year in college when the themes around race and identity felt especially relevant to my own experience as a new immigrant in the U.S. When I re-read it in early 2021, it didn't quite hit the mark.
[2] Below is a chart that breaks down the books I've read by genre since 2014, which illustrates some of the additional data categories available on StoryGraph. I was a little surprised (okay, maybe even a tad embarrassed) to see the "self help" genre up there in the top 3, with a total of 14 books, most of which I've read in the past two years. But I suppose there’s nothing wrong with a little self-improvement, after all. Some of the books in this category have really transformed how I approach life, like The Pathless Path, Four Thousand Weeks, Focusing, Creativity: A Short and Cheerful guide, and Steal Like an Artist.
[3] Ever since I discovered Ferrante through the My Brilliant Friend series, I’ve been truly enamoured by her writing, and I unreservedly recommend her books to anyone interested in sharp, candid, piercing exploration of human relationships.